Pictorial transcripts: multimedia processing applied to digital library creation

نویسندگان

Behzad Shahraray

David C. Gibbon

چکیده

This paper describes a working system for the automated archiving and selective retrieval of textual, pictorial and auditory information contained in video programs. Video processing performs the task of representing the visual information using a small subset of the video frames. Linguistic processing refines the closed caption text, generates table of contents, and creates links to relevant multimedia documents. Audio and video information are compressed and indexed based on their temporal association with the selected video frames and processed text. The derived information is used to automatically generate a hypermedia rendition of the program contents. This provides a compact representation of the information contained in the video program. It also serves as a textual and pictorial index for selective retrieval of the full-motion video program. This fully automatic system generates HyperText Markup Language (HTML) renditions of television programs, and makes them available for access over the Internet within seconds of their broadcast. This digital library currently contains over 2200 hours of television programs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition for a Digital Video Library

The standard method for making the full content of audio and video material searchable and is to annotate it with humangenerated meta-data that describes the content in a way that the search can understand, as is done in the creation of multimedia CD-ROMs. However, for the huge amounts of data that could usefully be included in digital video and audio libraries, the cost of producing this meta-...

متن کامل

The Effect of Multimedia Glosses on L2 Listening Comprehension

The present study examined the effect of multimedia glosses on foreign language listening comprehension. To this end, 94 male students studying at Rasa English Institute in Tehran were selected for the treatment. The participants consisted of three groups, and each group was randomly assigned to one of the following treatment conditions: textual, pictorial, and textual-pictorial glossing....

متن کامل

Indexing and search of multimodal information

The Informedia Digital Library Project allows full content indexing and retrieval of text, audio and video material. The integration of speech recognition, image processing, natural language processing and information retrieval overcomes limits in each technology to create a useful system. In order to answer the question how good speech recognition has to be in order to be useful and usable for...

متن کامل

MPEG-7 Pictorially Enriched Ontologies for Video Annotation

A system for the automatic creation of Pictorially Enriched Ontologies is presented, that is ontologies for context-based video digital libraries, enriched by pictorial concepts for video annotation, summarization and similarity-based retrieval. Extraction of pictorial concepts with video clips clustering, ontology storing with MPEG-7, and the use of the ontology for stored video annotation are...

متن کامل

Intelligent Content-Based Audio Classification and Retrieval for Web Applications

Content-based technology has emerged from the development of multimedia signal processing and wide spread of web application. In this chapter, we discuss the issues involved in the content-based audio classification and retrieval, including spoken document retrieval and music information retrieval. Further, along this direction, we conclude that the emerging audio ontology can be applied in fas...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1997

Pictorial transcripts: multimedia processing applied to digital library creation

نویسندگان

چکیده

منابع مشابه

Speech Recognition for a Digital Video Library

The Effect of Multimedia Glosses on L2 Listening Comprehension

Indexing and search of multimodal information

MPEG-7 Pictorially Enriched Ontologies for Video Annotation

Intelligent Content-Based Audio Classification and Retrieval for Web Applications

عنوان ژورنال:

اشتراک گذاری